Knock-Knock: Acoustic object recognition by using stacked denoising autoencoders

نویسندگان

Shan Luo

Leqi Zhu

Kaspar Althoefer

Hongbin Liu

چکیده

This paper presents a successful application of deep learning for object recognition based on acoustic data. The shortcomings of previously employed approaches where handcrafted features describing the acoustic data are being used, include limiting the capability of the found representation to be widely applicable and facing the risk of capturing only insignificant characteristics for a task. In contrast, there is no need to define the feature representation format when using multilayer/deep learning architecture methods: features can be learned from raw sensor data without defining discriminative characteristics a-priori. In this paper, stacked denoising autoencoders are applied to train a deep learning model. Knocking each object in our test set 120 times with a marker pen to obtain the auditory data, thirty different objects were successfully classified in our experiment and each object was knocked 120 times by a marker pen to obtain the auditory data. By employing the proposed deep learning framework, a high accuracy of 91.50% was achieved. A traditional method using handcrafted features with a shallow classifier was taken as a benchmark and the attained recognition rate was only 58.22%. Interestingly, a recognition rate of 82.00% was achieved when using a shallow classifier with raw acoustic data as input. In addition, we could show that the time taken to classify one object using deep learning was far less (by a factor of more than 6) than utilizing the traditional method. It was also explored how different model parameters in our deep architecture affect the recognition performance.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Method for Knock Signal Denoising in Spark Ignition Engine

One of the factors that affects the efficiency and lifetime of spark ignited internal combustion engine is “knock”. Knock sensor is a commonly used to detect this phenomenon. However, noise, limits detection accuracy of this sensor. In this study, Empirical Mode Decomposition (EMD) method is introduced as a fully adaptive signal-based analysis. Then, based on weighting decomposition...

متن کامل

Decoding Stacked Denoising Autoencoders

Data representation in a stacked denoising autoencoder is investigated. Decoding is a simple technique for translating a stacked denoising autoencoder into a composition of denoising autoencoders in the ground space. In the infinitesimal limit, a composition of denoising autoencoders is reduced to a continuous denoising autoencoder, which is rich in analytic properties and geometric interpretat...

متن کامل

Improved Knock Detection Method Based on New Time-Frequency Analysis In Spark Ignition Turbocharged Engine

Premature combustion that affects outputs, thermal efficiencies and lifetimes of internal combustion engine is called “knock effect”. However knock signal detection based on acoustic sensor is a challenging task due to existing of noise in the same frequency spectrum. Experimental results revealed that vibration signals, generated from knock, has certain frequencies related to vibration resonan...

متن کامل

Mid-level Features for Audio Chord Estimation using Stacked Denoising Autoencoders

Deep neural networks composed of several pre-trained layers have been successfully applied to various tasks related to audio processing. Stacked denoising autoencoders represent one type of such networks. They are discussed in this paper in application to audio feature extraction for audio chord estimation task. The features obtained from audio spectrogram with the help of autoencoders can be u...

متن کامل

Marginalized Stacked Denoising Autoencoders

Stacked Denoising Autoencoders (SDAs) [4] have been used successfully in many learning scenarios and application domains. In short, denoising autoencoders (DAs) train one-layer neural networks to reconstruct input data from partial random corruption. The denoisers are then stacked into deep learning architectures where the weights are fine-tuned with back-propagation. Alternatively, the outputs...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Neurocomputing

دوره 267 شماره

صفحات -

تاریخ انتشار 2017

Knock-Knock: Acoustic object recognition by using stacked denoising autoencoders

نویسندگان

چکیده

منابع مشابه

An Efficient Method for Knock Signal Denoising in Spark Ignition Engine

Decoding Stacked Denoising Autoencoders

Improved Knock Detection Method Based on New Time-Frequency Analysis In Spark Ignition Turbocharged Engine

Mid-level Features for Audio Chord Estimation using Stacked Denoising Autoencoders

Marginalized Stacked Denoising Autoencoders

عنوان ژورنال:

اشتراک گذاری